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GENE SCREENING METHOD USING NUCLEAR RECEPTOR 



Cross Reference to Related Applications 

This application is a continuation-in-part of International Patent Application 
5 No. PCT/JP98/03280, filed July 22, 1998, which claims priority fi-om Japanese Patent 
AppUcation No. JP 09/212624, filed July 22, 1997. 

Technical Field 

This invention relates to a method for screening a compound using the nature of 
1 0 transcriptional regulatory factors, mainly nuclear receptors, and a method for determining the 
compound. 

Specifically, it relates to a method for screening a gene encoding a polypeptide that 
converts a ligand precursor into a hgand, a polypeptide that converts a Ugand precursor 
obtainable by the screening method into a ligand, a gene encoding the polypeptide, and a 

15 method for determining whether or not a test gene encodes a polypeptide that converts a 
ligand precursor into a ligand. In addition, it relates to a method for screening a ligand that 
binds to a nuclear receptor, a ligand obtainable by the screening method, and a method for 
determining whether or not a test compound is a ligand that binds to a nuclear receptor. 
Furthermore, it relates to a method for screening a gene encoding a polypeptide that converts 

20 an inactive form of a transcriptional regulatory factor into an active form. 

Background of the Invention 

la,25-Dihydroxyvitamin D3 (la,25(OH)2D3) (A. W. Nonnan, J. Roth, L. Orchi, 
Endocr. Rev. 3, 331 (1982); H. F. DeLuca, Adv. Exp. Med. Biol. 196, 361 (1986); M. R. 
Walters, Endocr. Rev. 13, 719 (1992)) is a hormone form of vitamin D and the most 
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biologically active natural metabolite. This compound is generated by sequential 

hydroxylation. First, it is hydroxylated in the liver to generate 25-hydroxyvitamin D3 

(25(OH)D3), then subsequently hydroxylated in the kidney to generate la,25(OH)2D3 (H, 

Kawashima, S. Torikai, K. Kurokawa, Proc. Natl. Acad. Sci. USA 78, 1 199 (1981); H. L. 

5 Henry et al., J. Cell. Biochem. 49, 4 (1992)). The transactivation effect of vitamin D receptor 

(VDR) is provoked by the binding of la,25(OH)2D3 to VDR (M. Beato, P. HerrUch, G. 

Schuts, Cell 83, 851 (1995); H. Darwish and H. F. DeLuca, Eukaryotic Gene Exp. 3, 89 

(1993); B. D. Lemon, J. D. Fondell, L. P. Freedman, Mol. Cell. Biol. 17, 1923 (1997)). This 

regulates the transcription of a series of target genes involved in the major functions of 

^ 1 0 vitamin D, such as calciimi homeostasis, cell differentiation, and cell proliferation (D. D. 

J Bikle and S. Pillai, Endoc, Rev. 14, 3 (1992); R. Bouillon, W. H. Okamura, A. W. Norman, 

~ Endoc. Rev. 16, 200 (1995); M. T. Haussler et al., Recent Prog. Horai. Res. 44, 263 (1988); 

5 P. J. Malloy et al., J. Clin. Invest. 86, 2071 (1990)). The importance of the hydroxylation of 

□ 25(OH)D3 in the kidney in the synthesis of active vitamin D has been shown, and it has been 
111 1 5 beUeved for a long time that the hydroxylation is done by 25(OH)D3- 1 a hydroxylase 

□ (la(OH)-ase), which is localized especially at proximal renal tubules. The activity of 
la(OH)-ase is negatively regulated by its final product, la,25(OH)2D3 (Y. Tanaka and H. F. 
DeLuca, Science 183, 1198 (1974); K. Ikeda, T. Shinki, A. Yamaguchi, H. F. DeLuca, K. 
Kurokawa, T. Suda, Proc. Natl. Acad. Sci. USA 92, 61 12 (1995); H, L. Henry, R. J. Midgett, 

20 A. W. Norman, J. Biol. Chem. 249, 7584 (1 974)), and positively regulated by peptide 

hormones like calcitonin and PTH, which participate in calciimi regulation (H. Kawashima, 
S. Torikai, K. Kurokawa, Nature 291, 327 (1981); K. W. Colston, L. M. Evans, L. Galauto, 
L. Macintyre, D. W. Moss, Biochem. J. 134, 817 (1973); D. R. Fraser and E. Kodicek, 
Nature 241, 163 (1973); M. J. Beckman, J. A. Johnson, J. P. Goflf, T. A. Reinhardt, D. C. 
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Beitz, R. L. Horst, Arch. Biochem. Biophys. 319, 535 (1995)). The complicated regulation 

of the la (OH)-ase activity by these hormones maintains the serum concentration of 

la,25(OH)2D3 at a certain level. The mutation of the la(OH)-ase gene may causes a genetic 

disease, vitamin D-dependent type I rickets (D. Fraser, S. W. Kooh, H. P. Kind, M. F. 

5 HoUick, Y. Tanaka, H. F. DeLuca, N. Engl. J. Med. 289, 817 (1973); S. Balsan, in Rickets, F. 

H, Glorieux, Ed. (Raven, New York, 1991), pp 155-165), which also demonstrate the 

importance of the enzyme in vivo in the function of vitamin D. The biochemical analysis of 

partially purified la(OH)-ase protein strongly suggested that this enzyme belongs to P450 

family (S. Wakino et al., Gerontology 42, 67 (1996); Eva Axen, FEBS Lett. 375, 277 (1995); 

% 10 M. BurgosTrinidad, R. Ismaii, R. A. Ettinger, J. M. Prahl, H. F. DeLuca, J. Biol. Chem. 267, 

;| 3498 (1992); M. Warner et al., J. Biol. Chem. 257, 12995 (1982)). However, the 

biochemical characteristics of the enzyme and the molecular mechanism of the negative 

li feedback by la,25(OH)2D3 are not well understood since the enzyme purification is difficult 

i;3 and cDNA has not been cloned yet. Thus, the cDNA cloning of the enzyme had been 

ill 15 desired. Recently, the cloning of the rat enzyme that hydroxylates the la position of 

O vitamin D has been reported (J. Bone Min. Res. Vol. 1 1 (suppl) 117 (1996)). 

Conventionally, methods depend on phosphorylation of intracellular signal 

transduction factors or ion channels of membrane receptors as criteria have mainly been 

employed to screen genes encoding polypeptides that act on a specific nuclear receptor 

20 directly or indirectly, including la (OH)-ase mentioned above. Specifically, expression 

vectors into which a cDNA library or cDNA is inserted are introduced into cells or haploid 

individuals, for example Xenopus oocytes, and then phosphorylation, cell growth and the 

change in the electric current has been monitored for the screening. 



3 
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However, it has been very difficult to isolate genes by using these methods. 

Especially, highly sophisticated techniques are required for the expression cloning of an 

enzyme itself because the indicators available for the detection are limited. Therefore, the 

development of a simple and efficient screening method has been desired. 

5 Summary of the Invention 

An objective of the present invention is to provide a simple and efficient method for 
screening a gene encoding a polypeptide that converts a ligand precursor into a ligand, and a 
method for determining whether or not a test gene encodes a polypeptide that converts a 
Ugand precursor into a Hgand. Another objective of the present invention is to provide a 

1 0 method for isolating a polypeptide that converts a ligand precursor into a ligand and a gene 
encoding it, using the screening method. 

Furthermore, an objective of the invention is to provide a method for screening a 
hgand that binds to a nuclear receptor, a method for determining whether or not a test 
compound is a Ugand for a nuclear receptor, and a method for screening a gene encoding a 

1 5 polypeptide that converts an inactive form of a transcriptional regulatory factor into an active 
form, based on the screening method and the determination method described above. 

The present inventors investigated to achieve the above objectives and focused on the 
nature of nuclear receptors, which function as transcriptional regulatory factor by being 
bound by a hgand. We successfully constructed the system in which a hgand is formed by 

20 the expression of a polypeptide that converts a ligand precursor into a Ugand, and the Ugand 
thus formed binds to a nuclear receptor to thereby induce the expression of a reporter gene 
located downstream of the target sequence. We searched a gene library using this system 
and succeeded in isolating a gene encoding a polypeptide capable of converting a Ugand 
precursor into a ligand. 

4 
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Specifically, the inventors constracted a vector comprising a gene encoding a fusion 
polypeptide of DNA binding domain of GAL4 and ligand-binding domain of vitamin D 
receptor and a vector in which the lacZ gene, a reporter, is located downstream of the binding 
sequence of the DNA binding domain of GAL4. These two vectors, and subsequently the 
cDNA library, were introduced into cells. Then the reporter activity was measured after 
adding the vitamin D precursor Clones with the reporter activity were selected to examine 
whether or not they have the activities to convert the vitamin D precursor into vitamin D, 
thereby finding a clone that has the activity. 

Also, the inventors found that this system, which takes the advantage of the 
transcriptional regulatory fimction of a nuclear receptor, makes it possible to screen a hgand 
that binds to a nuclear receptor and to examine whether or not a test compound is a ligand 
that binds to the nuclear receptor. Specifically, for example, a library of test compoimds can 
be used in place of a ligand precursor and a gene library comprising the gene encoding a 
polypeptide that converts a precursor into a ligand in the system described above. When a 
test compound functions as a Hgand, the nuclear receptor promotes the transcription of the 
reporter gene. Thus, compounds that fimction as ligands can be screened firom the library 
simply by detecting the reporter activity as an index. 

Furthermore, the inventors found that the system utiUzing the transcriptional 
regulatory fimction of a nuclear receptor can be employed to screen genes that encode 
polypeptides capable of converting an inactive form of a wide range of transcriptional 
regulatory factors into an active form. In other words, the inventors found that the system in 
which the transcriptional regulatory fimction can be used to isolate factors involved in 
activation of various transcriptional regulatory factors, which have inactive and active forms. 
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such as transcriptional regulatory factors activated by phosphorylation as well as nuclear 

receptors activated by the binding of ligands. 

More specifically, this invention relates to: 

1 . a cell comprising a vector carrying a gene encoding a nuclear receptor and a vector 
carrying the binding sequence of the nuclear receptor and a reporter gene located 
downstream of said binding sequence; 

2. the cell of 1, wherein the nuclear receptor is a vitamin D receptor; 

3. a cell comprising a vector carrying a gene encoding a fusion polypeptide 
comprising DNA binding domain of a nuclear receptor and ligand-binding domain of a 
nuclear receptor, and a vector carrying the binding sequence of the DNA binding domain of 
the nuclear receptor and a reporter gene located downstream of the binding sequence; 

4. the cell of 3, wherein the DNA binding domain of the nuclear receptor is 
originated fi-om GAL4; 

5. the cell of 3, wherein the Ugand-binding domain of the nuclear receptor is 
originated from vitamin D receptor; 

6. a method for screening a ligand that binds to a nuclear receptor, the method 
comprising 

(A) contacting a test compound with the cell of any one of 1 to 5, 

(B) detecting the reporter activity, and 

(C) selecting the test compound which elicited the reporter activity in the cell; 

7. a method for determining whether or not a test compound is a ligand that binds to 
a nuclear receptor, the method comprising, 

(A) contacting a test compound with any one of the cell of 1 to 5, and 

(B) detecting the reporter activity; 
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8. a method for screening a gene encoding a polypeptide that converts a ligand 
precursor into a ligand, the method comprising 

(A) introducing a test gene into any one of the cell of 1 to 5, 

(B) contacting a ligand precursor to the cell into which the test gene is 
introduced, 

(C) detecting the reporter activity, and 

(D) isolating the test gene from the cell which showed the reporter activity; 

9. a method for determining whether or not a test gene encoding a polypeptide that 
converts a ligand precursor into a ligand, the method comprising 

(A) introducing a test gene into any one of the cell of 1 to 5, 

(B) contacting a ligand precursor to the cell into which the test gene is 
introduced, and 

(C) detecting the reporter activity; 

10. a method for screening a gene encoding a polypeptide that converts an inactive 
form of vitamin D3 into an active form, the method comprising 

(A) introducing a test gene into the cell of 2 or 5, 

(B) contacting an inactive form of vitamin D3 to the cell into which the test gene 
is introduced, 

(C) detecting the reporter activity, and 

(D) isolating the test gene from the cell that shows the reporter activity; 

1 L a method for determining whether or not a test gene encodes a polypeptide that 
converts an inactive form of vitamin D3 into an active form, the method comprismg 
(A) introducing a test gene into the cell of 2 or 5, 
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(B) contacting an inactive form of vitamin D3 with the cell into which the test 
gene is introduced, and 

(C) detecting the reporter activity; 

12. a ligand that binds to a nuclear receptor, which is obtainable by the method of 
claim 6; 

13. a gene encoding a polypeptide that converts a ligand precursor into a ligand, 
which is obtainable by the method of claim 8, 

14. a gene encoding a polypeptide that converts an inactive form of vitamin D3 into 
an active form, which is obtainable by the method of claim 10, 

15. a polypeptide comprising the amino acid sequence of SEQ ED NO: 1 or its 
derivative comprising said sequence in which one or more amino acids are substituted, 
deleted, or added, and having activity to convert an inactive form of vitamin D3 into an active 
form; 

16. a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or its 
derivative comprising said sequence in which one or more amino acids are substituted, 
deleted, or added, and having activity to convert an inactive form of vitamin D3 into an active 
form; 

17. a polypeptide encoded by a DNA that hybridizes with a DNA having the 
nucleotide sequence of SEQ ID NO: 3, wherein the polypeptide has activity to convert an 
inactive form of vitamin D3 into an active form; 

1 8. a polypeptide encoded by a DNA that hybridizes with the nucleotide sequence of 
SEQ ID NO: 4, wherein the polypeptide has activity to convert an inactive form of vitamin 
D3 into an active form; 

19. a DNA encoding any one of the polypeptide of 15 to 18; 

8 
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20. a DNA hybridizing with a DNA having the nucleotide sequence of SEQ E) 
NO: 3 and encoding a polypeptide having activity to convert an inactive form of vitamin D3 
into an active form; 

2 1 . a DNA hybridizing with a DNA having the nucleotide sequence of SEQ ID 
NO: 4 and encoding a polypeptide having activity to convert an inactive form of vitamin D3 
into an active form; 

22. a vector comprising any one of the DNA of 19 to 21 ; 

23. a transformant expressively retaining any one of the DNA of 19 to 21 ; 

24. a method for producing any one of the polypeptide of 15 to 18, the method 
comprising culturing the transformant of 23; 

25. an antibody that binds to any one of the polypeptide of 15 to 18; 

26. a method for screening a gene encoding a polypeptide that converts an inactive 
form of transcriptional regulatory factor into an active form, the method comprising 

(A) introducing a test gene into cells into which a vector comprising a gene 
encoding an inactive form of transcriptional regulatory factor and a vector comprising the 
binding sequence of said inactive transcriptional regulatory factor and a reporter gene located 
downstream thereof are introduced, 

(B) detecting the reporter activity, and 

(C) isolating the test gene from the cells showing the reporter activity; 

27. a method of 26, wherein the inactive transcriptional regulatory factor is a 
complex of non-phosphorylated NFkB and IkB, non-phosphorylated HSTF, or non- 
phosphorylated API. 
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The term "ligand" used herein means a compound that binds to a nuclear receptor and 
regulates the transcriptional activating abiUty of a target gene of the nuclear receptor. It 
includes not only naturally-occurring compounds but also synthetic compounds. 

The term "nuclear receptor" used herein means a factor that binds to an upstream site 
of a promoter of a target gene and hgand-dependently regulates transcription. 

The "polypeptide that converts a ligand precursor into a Ugand" includes a 
polypeptide that acts directly on a Ugand precursor to convert it into a ligand. It also includes 
a polypeptide that indirectly converts a ligand precursor into a ligand, for example, a 
polypeptide activating a polypeptide that directly acts on a ligand precursor to convert it into 
a Hgand. 

An "isolated nucleic acid" is a nucleic acid the structure of which is not identical to 
that of any naturally occurring nucleic acid or to that of any fragment of a naturally occurring 
genomic nucleic acid spanning more than three separate genes. The term therefore covers, 
for example, (a) a DNA which has the sequence of part of a naturally occurring genomic 
DNA molecule but is not flanked by both of the coding sequences that flank that part of the 
molecule in the genome of the organism in which it naturally occurs; (b) a nucleic acid 
incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner 
such that the resulting molecule is not identical to any naturally occurring vector or genomic 
DNA; (c) a separate molecule such as a cDNA, a genomic fragment, a fragment produced by 
polymerase chain reaction (PGR), or a restriction fragment; and (d) a recombinant nucleotide 
sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein. Specifically 
excluded from this definition are nucleic acids present in mixtures of different (i) DNA 
molecules, (ii) transfected cells, or (iii) cell clones: e.g., as these occur in a DNA library 
such as a cDNA or genomic DNA library. 

10 
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The term "substantially pure" as used herein in reference to a given polypeptide 
means that the polypeptide is substantially free from other biological macromolecules. The 
substantially pure polypeptide is at least 75% (e.g., at least 80, 85, 95, or 99%) pure by dry 
weight. Purity can be measured by any appropriate standard method, for example, by 
5 column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 

The "transcriptional regulatory factor" used herein means a factor that binds to an 
upstream site of a promoter of a target gene and regulates transcription of the target gene. 
The above-described nuclear receptor is included in the transcriptional regulatory factor of 
the invention. 

10 The "polypeptide that converts an inactive form of transcriptional regulatory factor 

into an active form" used herein includes not only a polypeptide that acts directly on an 
inactive form of transcriptional regulatory factor to convert it into an active form but also a 
polypeptide that indirectly converts an inactive form to an active form. When an inactive 
form of transcriptional regulatory factor is converted into an active form by phosphorylation, 

15 the transcriptional regulatory factor of the invention includes a polypeptide that activates a 
polypeptide phosphorylating the inactive form and indirectly converts the inactive form into 
the active form as well as a polypeptide directly involved in the phosphorylation. 

The first aspect of the present invention relates to a method for screening a gene 
encoding a polypeptide that converts a ligand precursor into a ligand, and a method for 

20 determining whether or not a test gene encodes a polypeptide that converts a lig^d precursor 
into a ligand. In these methods, a vector carrying a gene encoding a nuclear receptor 
(expression unit 1), and a vector carrying the binding sequence of the nuclear receptor and a 
reporter gene located downstream thereof (expression unit 2) are introduced into cells. Then, 
a test gene is introduced into the cells. 
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The "gene encoding a nuclear receptor" in the expression unit 1 is not particularly 
limited and any nuclear receptor gene can be used. For example, when orphan receptors such 
as PPAR, LXR, FXR, MB67, ONR, NUR, COUP, TR2, HNF4, ROR, Rev-erb, ERR, Ftz-Fl, 
Tlx and GCNF (Tanpakusitsu Kakusan Koso (Protein, Nucleic Acid, Enzyme) Vol. 41 No. 8 
pl265-1272 (1996)) are used as the nuclear receptor in the below-mentioned screening of 
unknown Hgands that bind to nuclear receptors or determination whether or not a test 
compound is a ligand binding to a nuclear receptor, the naturally-occurring or synthesized 
ligand can be detected and isolated. Furthermore, nuclear receptors for which the ligand and 
Ugand precursor are known, such as VDR (vitamin D receptor), ER, AR, GR, MR 
(Tanpakusitsu Kakusan Koso (Protein, Nucleic Acid, Enzyme) Vol. 41 No. 8 pl265-1272 
(1996)) are preferably used in the below-mentioned screening of genes encoding 
polypeptides that convert a hgand precursor into a Ugand or the determination whether or not 
a test gene encodes a polypeptide that converts a hgand precursor into a hgand. However, 
nuclear receptors used in the present invention are not limited thereto. 

In the present invention, the nuclear receptor gene can be used alone, and a fusion 
polypeptide gene comprising the DNA binding domain of a nuclear receptor and the hgand- 
binding domain of another nuclear receptor can also be used. For example, the DNA binding 
domain of GAL4 is preferably used as the DNA binding domain because it enhances the 
expression of the reporter gene downstream thereof 

The "binding sequence of a nuclear receptor" in the expression unit 2 varies 
depending on the nuclear receptor. In most nuclear receptors, sequences comprising 
"AGGTCA" are usually used. In the case of a dimeric nuclear receptor, the binding 
sequence is preferably composed of two repetition of the sequence. The repetitive sequences 
include the direct-repeat type, in which the two sequences are aUgned in the same direction, 

12 
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and the palindrome type, in which the sequences are directed to the center (Tanpakusitsu 

Kakusan Koso (Protein, Nucleic Acid, Enzyme) Vol. 41 No. 8 pl265-1272 (1996)). A 

spacer sequence usually exists between the repetition sequences, which can determine the 

specificity of the nuclear receptor (K. Umesono et al, Cell Vol. 65, pl255-1266 (1991)). 

A reporter gene located downstream of a nuclear receptor is not particularly limited. 

Preferable reporter genes are, for example, lacZ, CAT, and luciferase. Resistant genes to 

toxins or antibiotics, such as ampicillin resistant gene, tetracycline resistant gene, kanamycin 

resistant gene, can also be used to select cells by applying the corresponding toxin or 

antibiotic. 

The binding sequence of a nuclear receptor and the reporter gene are not necessarily 
connected directly. Some sequences that alter the strength of the promoter, for example, the 
promoter region of -globin, can be inserted between the binding sequence and the reporter 
gene. 

Animal cells are preferable for introducing these expression units. Cells with high 
transformation efficiency such as COS-1 cells and HeLa cells are particularly preferable. 
Vectors for animal cells such as "pcDNA3" (Invitrogen) are preferred to construct expression 
units. Vectors can be introduced into host cells by a known method such as calcium 
phosphate method, lipofection method, electroporation method and the like. 

A test gene is introduced into cells thus prepared. A test gene is not particularly 
limited, and any genes whose capability of converting a ligand precursor into a ligand is 
detected can be used. Genes are screened from cells or cDNA libraries prepared from 
mRNA isolated from tissues or the like, which are expected to express an objective gene. 
For example, a gene encoding a polypeptide that converts vitamin D precursor into active 
vitamin D can be screened from a cDNA library derived from kidney or the like. In this case, 

13 
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a vector expressing adrenodoxin (ADX) and an vector expressing adrenodoxin reductase 

(ADR) are preferably introduced into cells together with a test gene so as to efficiently 

generate active vitamin D. A test gene can be inserted into an appropriate vector and 

introduced into cells. For example, preferable vectors are 'pcDNAS' (Invitrogen) mentioned 

5 above or the Uke. 

Next, cells into which a test gene is introduced are contacted with a hgand precursor. 
As the hgand precursor, the one that acts on a nuclear receptor expressed by the expression 
unit 1 mentioned above is usually used. Examples of the hgand precursor include, without 
hmitation, 25-hydroxyvitamin D3, a precursor of VDR ligand (active vitamin D, 

1 0 1 a,25(OH)2D3); testosterone, a precursor of ER hgand (estrogen) and AR Ugand 
(dihydroxytestosterone); 1 1-deoxycortisol, a precursor of GR ligand (Cortisol), 
corticosterone, a precursor of MR hgand (aldosterone), etc. The contact of the ligand 
precursor with the cells can be performed by adding the hgand precursor to the culture 
medium of the cells, or a similar method, 

1 5 The reporter activity is then detected. If a test gene that is introduced mto cells 

encodes a polypeptide that converts a hgand precursor into a ligand, the hgand generates 
from the hgand precursor contacted with the cells, and binds to the nuclear receptor to make 
a ligand- nuclear receptor complex, which then binds to its target sequence to express the 
reporter gene. If the test gene does not encode a polypeptide that converts a ligand precursor 

20 into a ligand, the hgand is not produced from the hgand precursor and thus the reporter gene 
is not expressed. In this way, detecting the reporter activity enables judging whether or not 
the test gene encodes a polypeptide that converts a ligand precursor into a ligand. The 
reporter activity can be detected by a method weU known in the art using criteria such as 
staining, fluorescence, or ceU viability, depending on the reporter gene. 

14 
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When a gene library or the Hke is used instead of a single gene, cells are selected by 
the reporter activity to isolate the test gene. The test gene can be extracted from cells by, for 
example, the method described in H. S. Tong et al, Journal of Bone and Mineral Research 
Vol 9, 577-584 (1994). The primary structure of the gene extracted can be determined by a 
known method such as dideoxy method. 

The cells into which expression units 1 and 2 are introduced can be used for screening 
genes encoding polypeptides capable of converting a ligand precursor into a Hgand or 
determining whether or not a test gene encodes a polypeptide that converts a ligand precursor 
into a ligand. Furthermore, the cells can be used for screening ligands that bind to a nuclear 
receptor or determining whether or not a test compound is a Ugand that binds to a nuclear 
receptor. Specifically, a candidate for a ligand that acts on a nuclear receptor (a single test 
compound or a library of test compounds) is used instead of a ligand precursor and a 
candidate for a gene encoding a polypeptide that converts the Ugand precursor into the ligand 
(a single candidate gene, gene libraries, etc.). When a test compound functions as a Hgand, a 
complex of a nuclear receptor and the test compound (hgand) activates the reporter located 
downstream of the target sequence and thus whether or not the test compound function as a 
ligand can be judged. Furthermore, compounds that function as ligands can be screened from 
plural compounds by detecting the reporter activity. 

The inventors screened genes encoding polypeptides capable of converting the 
vitamin D precursor into active vitamin D as an example of the screening of genes encoding 
enzymes capable of converting a Ugand precursor into a Ugand, and obtained a desired gene. 
The present invention also relates to a polypeptide that converts the vitamin D precursor into 
active vitamin D and a gene encoding it. 
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Polypeptides derived from mouse and human that convert the vitamin D precursor 

into active vitamin D, which are encompassed by the polypeptides of the present invention, 

are shown in SEQ ID NO: 1 and SEQ ED NO: 2, respectively. Vitamin D is first 

hydroxylated in the liver to generate 25(OH)D3, then hydroxylated in the kidney to generate 

la,25(OH)2D3. The polypeptide of the present invention converts 25(OH)D3 into 

la,25(OH)2D3 by hydroxylation, namely hydroxylates the la position of vitamin D 

(la (OH)-ase). 

The polypeptide of the present invention can be a naturally-occurring protein. 
Alternatively, it can be prepared as a recombinant polypeptide by gene recombination 
techniques. Both are included in the polypeptide of the present invention. A naturally- 
occurring protein can be isolated by methods well known in the art, for example, from kidney 
cell extract by affinity chromatography using an antibody binding to the polypeptide of the 
present invention as described below. On the other hand, a recombinant protein can be 
prepared by culturing cells transformed with a DNA encoding the polypeptide of the present 
invention as described below. 

In addition, those skilled in the art can prepare polypeptides with substantially the 
same biological activity as the polypeptide set forth in SEQ ID NO: 1 (or SEQ ID NO: 2) by 
substituting amino acid(s) of the polypeptide or the like known method. The mutation of 
amino acids can occur spontaneously. The polypeptide of the present invention also includes 
the mutants of the polypeptide set forth in SEQ ID NO: 1 (or SEQ ID NO: 2) whose amino 
acid(s) are modified by substitution, deletion or addition, and which possesses the activity to 
convert the inactive form of vitamin D3 into the active form. A "conservative amino acid 
substitution" is one in which an amino acid residue is replaced with another residue having a 
chemically similar side chain. Families of amino acid residues having similar side chains 
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have been defined in the art. These famiUes include amino acids with basic side chains (e.g., 
lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged 
polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), 
nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, 
5 methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and 
aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). The known 
method of modifying aa amino acid sequence is, for example, the method described in the 
literature, "Shin Saiboukougaku Jikken Protocol, Ed. Department of Oncology, The Institute 
of Medical Science, The University of Tokyo, p241-248." Mutations can be introduced by 
C3 10 using commercially available 'QuickChange Site-Directed Mutagenesis Kit' (Stratagene). 
% It is a routine for those skilled in the art to prepare probes based on the entire or the 

partial nucleotide sequence of SEQ ID NO: 3 encoding the mouse polypeptide or SEQ ID 
m NO: 4 encoding the human polypeptide, isolate DNAs with high homology with the probes 

n from other species, and obtain polypeptides having the activities substantially equivalent to 

i IJ 1 5 those of the polypeptide of the present invention using a known method such as hybridization 
Q technique (K. Ebihara et al., Molecular and Celhilar Biology, Vol. 9, 577-584 (1994)) or 

polymerase chain reaction technique (S. Kitanaka et al.. Journal of Clinical Endocrinology 
and MetaboUsm, Vol. 82, 4054-4058 (1997)). Therefore, the polypeptides of the present 
invention include those encoded by DNAs that hybridize under stringent conditions with the 
20 DNA having the nucleotide sequence of SEQ ID NO: 3 or SEQ ID NO: 4, and having the 
activity to convert an inactive form of vitamin D3 into an active form. By "stringent 
conditions" is meant hybridization at 37°C, 1 X SSC, followed by washing at 42°C, 0.5 X 
SSC, Animal species used for isolating DNAs hybridizing with the DNA having the 
nucleotide sequence of SEQ ID NO: 3 or SEQ ID NO: 4 include rat, monkey, etc. DNAs 
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encoding polypeptides with biological activities substantially equivalent to those of the 

polypeptide set forth in SEQ ID NO: 1 or SEQ ID NO: 2 usually have high homology with 

the DNA set forth in SEQ ID NO: 3 or SEQ ID NO: 4. The "high homology" means 

sequence identity of 70% or more, preferably 80% or more, and more preferably 90% or 

5 more. The "percent identity" of two amino acid sequences or of two nucleic acids is 

determined using the algorithm of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87:2264- 

2268, 1990), modified as in Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 

1993). Such an algorithm is incorporated into the NBLAST and XBLAST programs of 

Altschul et al. (J. Mol. Biol. 215:403-410, 1990). BLAST nucleotide searches are performed 

10 with the NBLAST program, score = 100, wordlength =12. BLAST protein searches are 
performed with the XBLAST program, score = 50, wordlength = 3. Where gaps exist 
between two sequences. Gapped BLAST is utihzed as described in Altschul et al. (Nucleic 
Acids Res. 25:3389-3402, 1997). When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) are used. See 

1 5 http ://www.ncbi.nhn.nih.gov. 

Another aspect of the present invention relates to a DNA encoding the polypeptide of 
the present invention described above. The DNA of the present invention can be cDNA, 
genomic DNA, or synthetic DNA. It can be used not only to isolate a polypeptide with 
activities substantially equivalent to those of the polypeptide of the present invention from 

20 other species, but also to produce the polypeptide of the present invention as a recombinant 
polypeptide. Specifically, the DNA encoding the polypeptide of the present invention, for 
example, the DNA set forth in SEQ ID NO: 3 or SEQ ID NO: 4, is inserted into an 
appropriate vector, which are introduced into appropriate cells. The transformant cells are 
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cultured to express the polypeptide, and the recombinant polypeptide is purified from the 

culture. 

The cells used to produce the recombinant polypeptide include, for example, 
Escherichia coli and mammahan cells. The vectors used for expressing the recombinant 
polypeptide in the cells vary depending on host cells. For example, pGEX (Pharmacia) and 
pET (Novagen) are suitably used for coli, and pcDNAS (Invitrogen) is used suitably for 
animal cells. These vectors can be introduced into the host cells by heat-shock, for example. 
The recombinant polypeptide can easily be purified from the transformant by glutathione- 
Sepharose affinity chromatography when pGEX (Pharmacia) is used, and by nickel-agarose 
affinity chromatography when pET (Novagen) is used. 

Those skilled in the art can readily raise antibodies that bind to the polypeptide of the 
invention using the polypeptide prepared as described above. The polyclonal antibodies of 
present invention can be prepared by a well known method. For example, the polypeptide is 
injected into a rabbit or the like and IG fraction is purified by ammonium sulfate 
precipitation. Monoclonal antibodies can be produced by preparing hybridoma from spleen 
cells of mice immunized with the polypeptide of the present invention and myeloma cells and 
culturing the hybridoma to secrete the monoclonal antibody in the culture medium, 
intraperitoneally injecting the antibody obtained into an animal to obtain a large quantity of 
the antibody. 

The polypeptides, DNA, and antibodies of the present invention can be applied as 
follows. The polypeptides and DNA of the present invention can be used for therapy and/or 
diagnosis of patients with low la (OH)-ase activity, such as patients with defects in 
la (OH)-ase or renal failure. The present inventors have identified the mutation of the DNA 
of the present invention in vitamin D-dependent type I rickets case, specifically, P382S 
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(mutation from CCT to TCT), R335P (mutation from CGG to CCG), G125E (mutation from 
GGA to GAA), R107H (mutation from CGC to CAC). The present invention is also 
applicable to treat these patients. The mutations in the patients can be identified by 
extracting DNA from peripheral leukocytes of a patient, ampUfying the DNA by PGR using 
5 the primer in which each exon is set as intron, and determining the nucleotide sequence or 
the DNA by direct sequencing method. The DNA of the present invention can be used in 
gene therapy. In this case, the DNA of the invention is inserted into an appropriate vector, 
and the vector is introduced into the body in vivo or ex vivo, using retrovirus method, 
liposome method, or adenovirus method. The polypeptides of present invention can be used 
10 as an immobilized enzyme to produce active vitamin D derivatives, that is, hydroxylate la 

% position of vitamin D or its derivatives without a hydroxyl group at la position. 

{2 Furthermore, the antibodies of the present invention can be used for therapy of such as 

rg vitamin D excessiveness, granulomatous diseases, and lymphoma as well as purification of 

Q the polypeptides of present invention. 

ry 15 The inventors also enabled screening genes encoding a polypeptide enable of 

;3 converting an inactive form of various transcriptional regulatory factors into an active form 

using the above-described screening system of Ugands binding to nuclear receptors. 
Therefore, the present invention also relates to a method for screening a gene encoding a 
polypeptide that converts an inactive form of a transcriptional regulatory factor into an active 
20 form. 

There are several reports on the mechanism of the conversion of a transcriptional 
regulatory factor into its active form. For example, NFkB, a tissue specific factor, is bound 
to a factor named IkB in the cytoplasm. When it is treated with TP A, IkB dissociates, and 
NFkB translocates into a nucleus. Considering the effect of TPA treatment, the 
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phosphorylation by protein kinase C is probably involved in the conversion of NFkB into an 
active form. In the case of HSTF, its phosphorylation level is low before the heat-shock, and 
is high after the heat-shock. This indicates that the phosphorylation is involved in the 
conversion of HSTF into its active form. Phosphorylation is also considered to be involved 
5 in the conversion of API into its active form, 

GAL4 is an inactive form when GAL80 binds thereto before the induction by 
galactose. After the induction by galactose, the complex dissociates and GAL4 becomes an 
active form. Hsp90 binds to a glucocorticoid receptor before the hormone induction. After 
the induction, the complex dissociate to form an active form of glucocorticoid receptor 
^5 1 0 (Jikken Igaku (Experimental Medicine) Vol. 7, No.4 (1 989)). 

The "polypeptides that convert an inactive form of a transcriptional regulatory factor 
2 into an active form" used herein includes polypeptides functioning in activation of 

m transcriptional regulatory factors by dissociation of inhibitory factors, or by its quaUtative 

alteration, such as phosphorylation. The "inactive form of a transcriptional regulatory factor" 
ry 1 5 include, for example, a complex of non-phosphorylated NFkB and IkB, non-phosphorylated 
HSTF, non-phosphorylated API, as described above, but is not limited thereto. 

In this screening method, a gene encoding an inactive form of a transcriptional 
regulatory factor, instead of a nuclear receptor gene, is introduced into a vector to construct 
the "expression unit 1" described above, and a vector into which the binding sequence of the 
20 transcriptional regulatory factor and a reporter gene downstream thereof is constructed as the 
"expression unit 2." The expression units are introduced into cells, and a test gene is 
introduced into the cells. If the test gene introduced has activity to convert an inactive form 
of the transcriptional regulatory factor into an active form, the inactive transcriptional 
regulatory factor, which is the product of the expression unit 1, is converted into the active 
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form, and then active transcriptional regulatory factor binds to its binding sequence in the 

expression unit 2 to induce expression of the reporter gene. In contrast, when the test gene 

introduced does not have activity to convert an inactive transcriptional regulatory factor into 

an active form, the reporter gene in the expression unit 2 will not be induced. Therefore, one 

can judge whether or not a test gene has activity to convert an inactive transcriptional 

regulatory factor into its active form using the present screening method by detecting the 

reporter activity. 

When a gene library is used as a test gene, one can isolate a gene encoding a 
polypeptide with the activity to convert an inactive form of transcriptional regulatory factor 
into an active form from the library. 

Brief Description of Drawings 

Figure 1 schematically shows the expression cloning system mediated by VDR. 

Figure 2 is a graph showing the serum concentration of la,25(OH)2D3 in 3- and 7- 
week-old VDR+/+, VDR+/- and VDR-/- mice. 

Figure 3 is a micrograph of cells stained with X-gal. (b) presents COS-1 cells 
transformed with a expression cDNA library; (a), negative control; (c), positive control; and 
(d) stained cells with cDNA that was extracted from the positive cells in (b) and amplified by 
PGR. 

Figure 4 shows the putative amino acid sequence of CYPIAD. The first methionine 
is assigned as position 1. Asterisk indicates the terminal codon. Putative mitochondria 
targeting signal is surrounded by square. Underline indicates sterol binding domain. Dotted 
underiine indicates hem-binding domain. 
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Figure 5 shows homology of 'CYPIAD' to rat 25(OH)-ase (CYP27) and mouse 
24(OH)-ase (CYP24). Amino acid sequence homologies in sterol binding domain and hem- 
binding domain are also indicated. 

Figure 6 shows a photograph of 10% SDS-PAGE pattern of CYPl AD protein 
translated in vitro. 

Figure 7 shows the result of CAT assay for detecting in vivo activity of CYPl AD. 
The bottom panel shows a representative CAT assay, and the top panel shows the relative 
CAT activity as average and SEM from three independent experiments. 

Figure 8 shows the normal phase HPLC analysis of 25(OH)D3 metabolites. 

Figure 9 shows the reverse phase HPLC analysis of 25(OH)D3 metaboKtes. 

Figure 10 shows the northem blot analysis for analyzing tissue distribution of 
CYPIAD transcripts. 

Figure 1 1 shows the northem blot analysis of 3- and 7-week-old, VDR+/+, VDR-t-/- 
and VDR-/- mice, with(+) or without(-) overdosage of la525(OH)2D3 (50 ng/mouse). 

Figure 12 shows the relative amount of the hydroxylase gene in 3- or 7-week-old, 
VDR+/+, VDR+/- and VDR-/- mice, with(+) or without(-) overdosage of la,25(OH)2D3 
(50 ng/mouse). 

Detailed Description 

The present invention is demonstrated with reference to examples below, but is not 
to be construed being limited thereto. 
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Example 1 

Isolation ofcDNA encoding an enzyme that hydroxylates la position of 
vitamin D 

The inventors developed an expression cloning system mediated by a nuclear receptor 
5 for cloning a full-length cDNA encoding la (OH)-ase. The system is based on the 

mechanism that 25(OH)D35 a precursor of la,25(OH)2D3, can activate the transactivating 
function of VDR only in the presence of la (OH)-ase (Figure 1). In other words, the ligand- 
dependent transactivating function of VDR (AF-2) is induced by la,25(OH)2D3, but not by 
25(OH)D3. 25(OH)D3 is converted into 1 ,25(OH)2D3 only in cells expressing la 

10 (OH)-ase. Therefore, the cells can be detected by X-gal staining (M. A. Frederick et al., 
Current Protocols in Molecular Biology (Wiley, New York, 1995)) as the result of the 
expression of the lacZ reporter gene in the presence of 25(OH)D3. 

In the kidney of 7-week-old VDR-deficient mice (VDR-/- mice), the serum 
concentration of la,25(OH)2D3 was extremely high (Figure 2), which suggested the high 

15 la (OH)-ase activity. Therefore, the kidney of 7-week-old VDR-/- mice was used to prepare 
an expression library. Poly(A)'^ RNA was purified (K. Takeyama et al, Biochem. Biophys. 
Res. Commun. 222, 395 (1996); H. Mano et al, J. Biol. Chem. 269, 1591 (1994)), and total 
cDNA was prepared from poly(A)^ RNA (U. Gubler and B. J. Hoffinan, Gene 25, 263 
(1983); M. Kobori and H. Nojima, Nucleic Acid Res. 21, 2782 (1993)). The total cDNA was 

20 inserted into the Hindlll position of pcDNA3 (Invitrogen), a expression vector that is derived 
from SV40, functions in mammals, and autonomously replicates in COS-1 cells. The 
reporter plasmid, 17M2-G-lacZ, was constructed by inserting yeast GAL4 (UAS) x 2 and 
p-globulin promoter into the multicloning site of Basic expression vector (Clontech). The 
function of AF-2 induced by a ligand was detected using VDR-ligand-binding domain fused 
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with GAL4-DNA binding domain (VDR-DEF) [GAL4-VDR(DEF)] (K. Ebihara et ah, Mol. 

Cell. Biol. 16, 3393 (1996); T. Imai et al, Biochem. Biophys. Res. Commun. 233, 765 

(1997)). Cos-1 cells cultured in Dulbecco's Modified Eagle's Medium (DMEM) 

supplemented with 10% fetal calf serum were transiently transformed with 0.5 g of GAL4- 

VDR (DEF) expression vector, 1 g of 17M2-G-lacZ, 0.2 g each of ADX expression vector 

and ADR expression vector (T. Sakaki, S. Kominami, K. Hayashi, M. AkiyoshiShibata, Y. 

Yabusaki, J. Biol. Chem. 271, 26209 (1996); F. J. Dilworth et al., J. Biol. Chem. 270, 16766 

(1995)), and 0. 1 g of the expression cDNA library, using Lipofectin (GIBCO BRL). lO'^M 

25(OH)D3 was added to the culture medium 12 hours after the transformation. Cells were 

fixed with 0.05% glutaraldehyde 48 hours after the transformation and were then incubated 

with X-gal at 37°C for 4 hours to identify p-galactosidase positive cells expressing 

la (OH)-ase by X-gal staining (Figure 3(c)) (M. A. Fredrick et al., Current Protocols in 

Molecular Biology (Wiley, New York, 1995)). In the negative control, the expression cDNA 

library was not used (Figure 3(a)). In the positive controls, the expression library was not 

used, and la,25(OH)2D3 was used instead of 25(OH)D3 (Figure 3(b)). 

The stained cells were selectively collected by micromanipulation using a 

micropipette with 40 )Lim diameter under an inverted microscope (H. S. Tong et al., J. Bone 

Miner. Res. 9, 577 (1994)), then transferred into PCR buffer solution. The PCR products 

were electrophoresed on 1% agarose gel, and fi-agments of about 2.0 to 2.5 kb, which is the 

expected cDNA size of the fixU-length 1 (OH)-ase, are pxirified and subcloned into 

pcDNA3. Sequence analysis of cDNA isolated from randomly selected 64 clones showed 

that 13 clones encode completely identical ORF. COS-1 cells into which the single cDNA 

clone was introduced were positive in X-gal staining (Figure 3(d)). 
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The full-length cDNA was obtained by the colony hybridization screening of the 

same library using the cDNA as a probe. The amino acid sequence deduced from ORF is a 

novel polypeptide with 507 amino acids (Figure 4). 

The polypeptide, hereinafter called "CYPl AD," has a mitochondria-targeting signal 

5 and has significant homologies with P450 family members (D. W. Nebert, DNA Cell Biol. 

10, 1 (1991)). Especially, the homology with rat vitamin D3 25-hydroxylase is 41,7% and 

that with mouse 25(OH)D3 24-hydroxylase is 31.6% (Figure 5)(0. Masumoto, Y. Ohyama, 

K. Okuda, J. Biol. Chem. 263, 14256 (1988); E. Usui, M. Noshiro, Y. Ohyama, K. Okuda, 

FEBS Lett. 262, 367 (1990); Y. Ohyama and K. Okuda, J. Biol Chem. 266, 8690 (1991); S. 

10 Itoh et al, Biochem. Biophys. Acta. 1264, 26 (1995)). The homologies for sterol domain, 

15 especially conserved domain, in these enzymes are 93% and 60%, respectively, and those for 

{2 hem binding domain are 70% and 80%, respectively. 

Ill The 10% SDS-PAGE analysis of CYPIAD protein, which was translated in vitro in 

u the presence of [^^S] methionine using Reticulocyte Lysate System (Promega) (H. Sasaki et 

rU 15 al., Biochemistry 34, 370 (1995)) revealed that the molecular weight of the polypeptide is 
O approximately 55 kDa (Figure 6), which is identical to the molecular weight of partially 

purified la (OH)-ase (S. Wakino et al., Gerontology 42, 67 (1996); Eva Axen, FEBS Lett. 

375, 277 (1995); M. Burgos-Trinidad, R. Ismail, R. A. Ettinger, J. M. Prahl, H. F. DeLuca, J. 

Biol. Chem, 267, 3498 (1992); M. Wamer et al, J. Biol. Chem. 257, 12995 (1982)). 

20 

Example 2 

Detection of in vivo activity of CYPIAD 

To confirm that CYPIAD has abihty to activate the transactivating function of VDR 

by converting 25(OH)D3 into active vitamin D in vivo, COS-1 cells were co-transformed 
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with 0.5 lag of GAL4-VDR(DEF) expression vector, 1 ^tg of 17M2-G-CAT (S. Kato et al, 

Science 270, 1491 (1995)), 0.5 )ag each of ADX expression vector and ADR expression 

vector (T. Sakaki, S. Kominami, K. Hayashi, M. Akiyoshi-Shibata, Y. Yabusaki, J. Biol. 

Chem, 271, 26209 (1996); F. J. Dilworth et al, J. Biol. Chem. 270, 16766 (1995)), and 1 ^ig 

5 of CYPl AD expression vector, in the presence of 25(OH)D3 or la,25(OH)2D3. A 

representative CAT assay is shown at the bottom panel of Figure 7. The relative CAT 

activities are shown at the top panel of Figure 7, as the average and SEM of three 

independent experiments. 25(OH)D3 activated the CAT reporter gene when CYPl AD was 

expressed, while only la,25(OH)2D3 activated the reporter gene without using CYPl AD 

5 10 expression vector. However, 25(OH)D3 did not significantly activate the reporter gene in the 

S absence of ADX or ADR. These results strongly suggest that CYPIAD is la (OH)-ase, 

'■'^ 

2 which converts 25(OHp3 into la,25(OH)2D3. 

j i Example 3 

2 15 Chemical analysis of CYPIAD products 

To chemically determine the enzyme product of CYPIAD, normal phase HPLC and 

reversed phase HPLC were performed (E. B. Mawer et al., J. Clin. Endocrinol. Metab. 79, 

554 (1994); H. Fujii et al., EMBO J., in press (1997)). The cells (5x10^) transformed with 

ADR expression vector, ADX expression vector and CYPIAD expression vector (Figure 

20 8(b)), or the cells (5x10^) not transformed (Figure 8(c)) were incubated in the presence of 

[^H]25(OH)D3 (10^ dpm; 6.66 terabecquerel/nraiol, Amersham International) at 37°C for 

6 hours. The culture media were extracted with chloroform, and the extract was analyzed by 

normal phase HPLC using TSK-gel siUca 150 column (4.6x250mm, Tosoh), with 

hexane/isopropanol/methanol (88:6:6) for mobile phase, at the flow rate of 1.0 ml/min. The 
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eluate was collected and its radioactivity was measured using a liquid scintillation coimter (E. 
B. Mawer et al., J. Clin. Endocrinol. Metab. 79, 554 (1994); H. Fujii et al, EMBO J. in press, 
(1997)). The standard samples of vitamin D derivatives, namely, la,(0H)D3, 25(OH)D3, 
24,25(OH)2D3, la,25(OH)2D3 and la,24,25(OH)3D3, were applied to chromatography to 
determine their retention time by UV absorbance at 264 nm (Figure 8(a)). 

Likewise, reverse phase HPLC was performed with a colunm filled with Cosmasil 
5C18-AR (4.6x150 mm Nacalai Tesque) at flow rate of 1,0 ml/min to confirm the existence 
of [^H] la,25(OH)2D3. The chromatograms of standard samples for vitamin D derivatives, 
and the reaction product in the presence or absence of CYPIAD, are shown in Figure 9(a), 
(b), and (c), respectively. 

The retention times of enzyme products in normal phase HPLC and reverse phase 
HPLC were completely identical to that of sample, la,25(OH)2D3 standard. The results 
indicate that the cDNA of CYPIAD encodes mouse la (OH)-ase, which hydroxylates 
25(OH)D3 to la,25(OH)2D3. 

Example 4 

Analysis of tissue distribution of CYPIAD transcripts 

The tissue distribution of CYPIAD transcripts in 7-week-old normal and VDR-A 

mice was examined. Poly(A)'^ RNA was extracted fi:om brain, lung, heart, liver, spleen, 

kidney, small intestine, skeletal muscle, skin, and bone, and analyzed by northern blot 

technique using cDNA of CYPIAD and p-actin as probes (K. Takeyama et al., Biochem. 

Biophys. Res. Commun. 222, 395 (1996); H. Mano et al., J. Biol. Chem. 269, 1591 (1994)). 

As the result, the transcript of CYPIAD was detected as a single band in the kidney. The size 

of the transcript (2.4 kbp) is identical to that of cloned cDNA (Figure 10). Except for kidney, 

28 



Patent Application 
Attorney Docket No. 06501/054001 

in, la (OH)-ase activity has been reported in other tissues than kidney (A. W. Norman, J. 

Roth, L. Orchi, Endocr. Rev. 3, 331 (1982); H. F. DeLuca, Adv. Exp. Med. Biol. 196, 361 

(1986); M. R. Walters, Endocr. Rev 13, 719 (1992); G. A. Howard, R. T. Turner, D. J. 

Sherrard, D. J. Baylink, J. Biol. Chem. 256, 7738 (1981); T. K. Gray, G. E. Lester, R. S. 

Lorenc, Science 204, 1311 (1979)). However, the transcript of la (OH)-ase was not detected 

in tissues other than kidney in this experiment. 

The northern blot analysis of the expression of the CYPl AD gene and the 

24(OH)-ase (CYP24) gene was performed in 3- and 7-week-old VDR+/+, VDR+/-, and 

VDR-A mice, with (+) or without (-) administration of excess la,25(OH)2D3 (50 ng/mouse). 

A representative northern blot analysis is shown in Figure 11, The relative amount of the 

hydroxylase gene standardized with the p-actin gene transcripts was measured in at least 5 

mice for each group (Figure 12), Interestingly, the marked induction of the gene was seen in 

VDR-A mice (2.5 and 50 times in 3- and 7-week-old mice, respectively)(Figure 1 1, 12). In 

VDR+/H- mice and VDR+/- mice, the administration of la,25(OH)2D3 significantly inhibited 

expression of the la (OH)-ase gene, whereas the inhibition did not occurred in 3- and 7- 

week-old VDR-/- mice. Therefore, the overexpression of la (OH)-ase appears to cause raise 

in the serum level of la,25(OH)2D3 in 7-week-old VDR-/- mice compared with the normal 

level (Figure 2). Considering these results, it can be considered that Ugand-bound VDR is 

involved in the negative regulation of the la (OH)-ase gene expression by la,25(OH)2D3. In 

VDR-/- mice, the expression of the 24(OH)-ase gene was decreased to the imdetectable level, 

and the reaction against la,25(OH)2D3 was not seen (Figure 11,12). The 24(OH)-ase gene 

converts 25(OH)D3 to 24,25(OH)2D3, which is an inactive form of vitamin D, and its gene 

expression is positively regulated by la,25(OH)2D3. These results confirmed that the ligand- 

bound VDR is involved in the gene expression induced by la,25(OH)2D3 through vitamin D 
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responsive element in the promoter of the 24(OH)-ase gene (C. Zierold, H. M. Darwish, H. F. 
DeLuca, J. Biol, Chem. 270, 1675 (1995); Y, Ohyama et al., J. Biol. Chem, 269, 10545 
(1994)). Therefore, the Ugand-bound VDR adversely regulates the expression of la (OH)- 
ase and 24(OH)-ase genes by la,25(OH)2D3. 

Example 5 

Isolation of human gene encoding an enzyme that hydroxylates the 1 
position of vitamin D 

A normal human kidney cDNA library was prepared by extracting poly(A) RNA 
from normal human kidney tissue using the SacII(500bp)-Eco-RI(1200bp) fragment of 
mouse la (OH)-ase as a probe and inserting the RNA into ?i-ZAPIL A human gene 
encoding the enzyme that hydroxylates la position of vitamin D was obtained by screening 
the library prepared above by plaque hybridization method. The nucleotide sequence of the 
isolated gene is shown in SEQ ID NO: 4, and the putative amino acid sequence is shown in 
SEQ ID NO: 2. 

Industrial Applicability 

The present invention provides a method for screening genes encoding polypeptides 
capable of converting a ligand precursor into a ligand, and a method for determining whether 
or not a test gene encodes a polypeptide that converts a ligand precursor into a ligand. The 
method of the present invention, xmlike the existing expression cloning method, 
advantageously utilizes the nature of nuclear receptors that regulate transcription by being 
bound by a hgand. Since a desired gene can be detected by the reporter activity, the method 
of the invention enables simply and efficiently detecting and isolating a gene even if it 
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encodes a polypeptide that is expressed at a low level. The present invention also provides a 
polypeptide that converts a ligand precursor into a ligand, namely, a polypeptide that 
converts an inactive form of vitamin D3 into its active form and a gene encoding it, which are 
obtained by the screening method as described above. The polypeptide and gene of the 
present invention can be used for treating and/or preventing defects in la (OH)-ase or renal 
failure. The polypeptide of the present invention can also be used to produce active vitamin 
D derivatives, namely, hydroxylate la position of vitamin D or its derivatives without a 
hydroxyl group at la position. The antibodies against the polypeptide of the present 
invention can be used to purify the polypeptide of the present invention, and to treat vitamin 
D excessiveness, granulomatous diseases, lymphoma, and the like. 

In addition, the present invention provides a method for screening ligands that bind to 
nuclear receptors, and a method for determining whether or not a test compound is a ligand 
of the nuclear receptor. The method also takes advantage of the nature of nuclear receptors 
and uses the reporter activity for the detection. These methods are thus simple and efficient 
as well as the method described above. For example, the method is useful in searching 
ligands for orphan receptors, for which ligands are unknown. 

Furthermore, the present invention provides a method for screening genes encoding 
polypeptides capable of converting an inactive form of transcriptional regulatory factor into 
an active form, based on the screening method described above. This method enables easily 
isolating genes that encode polypeptides capable of converting an inactive form of various 
transcriptional regulatory factors into the active form by detecting the reporter activity. 

What is claimed is: 
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1 . A cell comprising a vector carrying a gene encoding a nuclear receptor and a 
vector carrying the binding sequence of the nuclear receptor and a reporter gene located 
downstream of said binding sequence 

2. The cell of claim 1 , wherein the nuclear receptor is a vitamin D receptor. 

3. A cell comprising a vector carrying a gene encoding a fusion polypeptide 
comprising DNA binding domain of a nuclear receptor and ligand-binding domain of a 
nuclear receptor, and a vector carrying the binding sequence of the DNA binding domain of 
the nuclear receptor and a reporter gene located downstream of the binding sequence. 

4. The cell of claim 3, wherein the DNA binding domain of the nuclear receptor 
is originated from GAL4. 

5. The cell of claim 3, wherein the ligand-binding domain of the nuclear receptor 
is originated from vitamin D receptor. 

6. A method for screening a ligand that binds to a nuclear receptor, the method 
comprising 

(A) contacting a test compound with the cell of claim 1, 

(B) detecting the reporter activity, and 

(C) selecting the test compound which ehcited the reporter activity in the cell. 
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1 7. A method for determining whether or not a test compound is a ligand that 

2 binds to a nuclear receptor, the method comprising, 

3 (A) contacting a test compound with the cell of claim 1, and 

4 (B) detecting the reporter activity. 

1 8. A method for screening a gene encoding a polypeptide that converts a ligand 

2 precursor into a ligand, the method comprising 

3 (A) introducing a test gene into the cell of claim 1, 

4 (B) contacting a Hgand precursor to the cell into which the test gene is 

5 introduced, 

6 (C) detecting the reporter activity, and 

7 (D) isolating the test gene from the cell which showed the reporter activity. 

1 9. A method for determining whether or not a test gene encoding a polypeptide 

2 that converts a ligand precursor into a ligand, the method comprising 

3 (A) introducing a test gene into the cell of claim 1 , 

4 (B) contacting a ligand precursor to the cell into which the test gene is 

5 introduced, and 

6 (C) detecting the reporter activity. 
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10. A method for screening a gene encoding a polypeptide that converts an 
inactive form of vitamin D3 into an active form, the method comprising 
(A) introducing a test gene into the cell of claim 2, 
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4 (B) contacting an inactive form of vitamin D3 to the cell into which the test gene 

5 is introduced, 

6 (C) detecting the reporter activity, and 

7 (D) isolating the test gene from the cell that shows the reporter activity. 

1 1 1 . A method for determining whether or not a test gene encodes a polypeptide 

2 that converts an inactive form of vitamin D3 into an active form, the method comprising 

3 (A) introducing a test gene into the cell of claim 2, 

4 (B) contacting an inactive form of vitamin D3 with the cell into which the test 

5 gene is introduced, and 

6 (C) detecting the reporter activity. 

1 12. A ligand that binds to a nuclear receptor, which is obtainable by the method of 

2 claim 6. 

1 13. A gene encoding a polypeptide that converts a ligand precursor into a ligand, 

2 which is obtainable by the method of claim 8. 

1 14, A gene encoding a polypeptide that converts an inactive form of vitamin D3 

2 into an active form, which is obtainable by the method of claim 1 0. 

1 1 5 . A polypeptide comprising the amino acid sequence of SEQ ID NO : 1 or its 

2 derivative comprising said sequence in which one or more amino acids are substituted, 
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3 deleted, or added, and having activity to convert an inactive form of vitamin D3 into an active 

4 form. 

1 16. A polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or its 



2 derivative comprising said sequence in which one or more amino acids are substituted, 

3 deleted, or added, and having activity to convert an inactive form of vitamin D3 into an active 

4 form. 

1 17. A polypeptide encoded by a DNA that hybridizes with a DNA having the 

2 nucleotide sequence of SEQ ID NO: 3, wherein the polypeptide has activity to convert an 

3 inactive form of vitamin D3 into an active form. 

1 1 8. A polypeptide encoded by a DNA that hybridizes with the nucleotide 

2 sequence of SEQ ID NO: 4, wherein the polypeptide has activity to convert an inactive form 

3 of vitamin D3 into an active form. 

1 1 9. A DNA encoding the polypeptide of claim 15. 

1 20. A DNA hybridizing with a DNA having the nucleotide sequence of SEQ ID 

2 NO: 3 and encoding a polypeptide having activity to convert an inactive form of vitamin D3 

3 into an active form. 
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21. A DNA hybridizing with a DNA having the nucleotide sequence of SEQ ID 
NO: 4 and encoding a polypeptide having activity to convert an inactive form of vitamin D3 
into an active form. 

22. A vector comprising the DNA of claim 20. 

23 . A transformant expressively retaining the DNA of claim 20. 

24. A method for producing polypeptide, the method comprising culturing the 
transformant of claim 23. 

25 . An antibody that binds to the polypeptide of claim 1 5 . 

26. A method for screening a gene encoding a polypeptide that converts an 
inactive form of transcriptional regulatory factor into an active form, the method comprising 

(A) introducing a test gene into cells into which a vector comprising a gene 
encoding an inactive form of transcriptional regulatory factor and a vector comprising the 
binding sequence of said inactive transcriptional regulatory factor and a reporter gene located 
downstream thereof are introduced, 

(B) detecting the reporter activity, and 

(C) isolating the test gene from the cells showing the reporter activity. 
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1 27. The method of claim 26, wherein the inactive transcriptional regulatory factor 

2 is a complex of non-phosphorylated NFkB and IkB, non-phosphorylated HSTF, or non- 

3 phosphorylated AP 1 . 
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GENE SCREENING METHOD USING NUCLEAR RECEPTOR 

Abstract of the Disclosure 
A system in which a ligand is formed by the expression of a polypeptide that 
converts a ligand precursor into a ligand, and the hgand thus formed binds to a nuclear 
receptor to thereby induce the expression of a reporter gene located downstream of the 
target sequence is constructed. Searching a gene library using this system can isolate a 
gene encoding a polypeptide capable of converting a Ugand precursor into a hgand. This 
system, which takes the advantage of the transcriptional regulatory function of a nuclear 
receptor, enables screening a hgand that binds to a nuclear receptor and to examine whether 
or not a test compound is a hgand that binds to the nuclear receptor, and also screening 
genes that encode polypeptides capable of converting an inactive form of a wide range of 
transcriptional regulatory factors into an active form. 
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Sequence Listing 



<110> CHUGAI SEIYAKU KABUSHIKI KAISHA. 

<120> Gene screening method using nuclear receptor 

<130> C1-901PCT 

<140> 
<141> 

<150> JP 09/212624 
<151> 1997-7-22 

<160> 4 

<170> Patent In Ver. 2.0 

<210> 1 
<211> 507 
<212> RPT 

<213> Mus musculus 
<400> 1 



Ser Arg Val Phe His Arg lie His 

10 15 
Leu Gly Ser Arg Gly Ser GIu Ser 
25 30 
Pro Gly Pro Ser Thr Leu Ser Phe 
45 

Gly Leu Ser Arg Leu His Glu Leu 
60 

Gly Pro lie Trp Ser Gly Ser Phe 
75 80 



Met Thr Gin Ala Val Lys Leu Ala 
1 5 
Leu Pro Leu Gin Leu Asp Ala Ser 
20 

Val Leu Arg Ser Leu Ser Asp He 
35 40 
Leu Ala GIu Leu Phe Cys Lys Gly 

50 55 
Gin Val His Gly Ala Ala Arg Tyr 
65 70 
Gly Thr Leu Arg Thr Val Tyr Val 
85 



2/12 



Ala Asp Pro Thr Leu Val Glu Gin Leu Leu Arg Gin Glu Ser His Cys 

90 95 100 

Pro Glu Arg Cys Ser Phe Ser Ser Trp Ala Glu His Arg Arg Arg His 
105 110 115 120 

Gin Arg Ala Cys Gly Leu Leu Thr Ala Asp Gly Glu Glu Trp Gin Arg 

125 130 135 

Leu Arg Ser Leu Leu Ala Pro Leu Leu Leu Arg Pro Gin Ala Ala Ala 

140 145 150 

Gly Tyr Ala Gly Thr Leu Asp Asn Val Val Arg Asp Leu Val Arg Arg 

155 160 165 

Leu Arg Arg Gin Arg Gly Arg Gly Ser Gly Leu Pro Gly Leu Val Leu 

170 175 180 

Asp Val Ala Gly Glu Phe Tyr Lys Phe Gly Leu Glu Ser lie Gly Ala 
185 190 195 200 

Val Leu Leu Gly Ser Arg Leu Gly Cys Leu Glu Ala Glu Val Pro Pro 

205 210 215 

Asp Thr Glu Thr Phe He His Ala Val Gly Ser Val Phe Val Ser Thr 

220 225 230 

Leu Leu Thr Met Ala Met Pro Asn Trp Leu His His Leu lie Pro Gly 

235 240 245 

Pro Trp Ala Arg Leu Cys Arg Asp Trp Asp Gin Met Phe Ala Phe Ala 

250 255 260 

Gin Arg His Val Glu Leu Arg Glu Gly Glu Ala Ala Met Arg Asn Gin 
265 270 275 280 

Gly Lys Pro Glu Glu Asp Met Pro Ser Gly His His Leu Thr His Phe 

285 290 295 

Leu Phe Arg Glu Lys Val Ser Val Gin Ser lie Val Gly Asn Val Thr 

300 305 310 

Glu Leu Leu Leu Ala Gly Val Asp Thr Val Ser Asn Thr Leu Ser Trp 

315 320 325 

Thr Leu Tyr Glu Leu Ser Arg His Pro Asp Val Gin Thr Ala Leu His 

330 335 340 

Ser Glu lie Thr Ala Gly Thr Arg Gly Ser Cys Ala His Pro His Gly 
345 350 355 360 

Thr Ala Leu Ser Gin Leu Pro Leu Leu Lys Ala Val He Lys Glu Val 
365 370 375 
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Leu Arg Leu Tyr Pro Val Val Pro Gly Asn Ser Arg Val Pro Asp Arg 

380 385 390 

Asp lie Arg Val Gly Asn Tyr Val He Pro Gin Asp Thr Leu Val Ser 

395 400 405 

Leu Cys His Tyr Ala Thr Ser Arg Asp Pro Thr Gin Phe Pro Asp Pro 

410 415 420 

Asn Ser Phe Asn Pro Ala Arg Trp Leu Gly Glu Gly Pro Thr Pro His 
425 430 435 440 

Pro Phe Ala Ser Leu Pro Phe Gly Phe Gly Lys Arg Ser Cys lie Gly 

445 450 455 

Arg Arg Leu Ala Glu Leu Glu Leu Gin Met Ala Leu Ser Gin lie Leu 

460 465 470 

Thr His Phe Glu Val Leu Pro Glu Pro Gly Ala Leu Pro lie Lys Pro 

475 480 485 

Met Thr Arg Thr Val Leu Val Pro Glu Arg Ser He Asn Leu Gin Phe 

490 495 500 

Val Asp Arg 
505 



<210> 2 
<211> 508 
<212> RPT 
<213> Homo sapiens 



<400> 2 

Met Thr Gin Thr Leu Lys Tyr Ala 
1 5 
Trp Ala Pro Glu Leu Gly Ala Ser 
20 

Ala Arg Arg Ser Leu Ala Asp He 
35 40 
Leu Ala Glu Leu Phe Cys Lys Gly 

50 55 
Gin Val Gin Gly Ala Ala His Phe 
65 70 
Gly Thr Val Arg Thr Val Tyr Val 



Ser Arg Val Phe His Arg Val Arg 

10 15 
Leu Gly Tyr Arg Glu Tyr His Ser 
25 30 
Pro Gly Pro Ser Thr Pro Ser Phe 
45 

Gly Leu Ser Arg Leu His Glu Leu 
60 

Gly Pro Val Trp Leu Ala Ser Phe 
75 80 
Ala Ala Pro Ala Leu Val Glu Glu 
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85 90 95 

Leu Leu Arg Gin Glu Gly Pro Arg Pro Glu Arg Cys Ser Phe Ser Pro 

100 105 110 

Trp Thr Glu His Arg Arg Cys Arg Gin Arg Ala Cys Gly Leu Leu Thr 

115 120 125 

Ala Glu Gly Glu Glu Trp Gin Arg Leu Arg Ser Leu Leu Ala Pro Leu 

130 135 140 

Leu Leu Arg Pro Gin Ala Ala Ala Arg Tyr Ala Gly Thr Leu Asn Asn 
145 150 155 160 

Val Val Cys Asp Leu Val Arg Arg Leu Arg Arg Gin Arg Gly Arg Gly 

165 170 175 

Thr Gly Pro Pro Ala Leu Val Arg Asp Val Ala Gly Glu Phe Tyr Lys 

180 185 190 

Phe Gly Leu Glu Gly He Ala Ala Val Leu Leu Gly Ser Arg Leu Gly 

195 200 205 

Cys Leu Glu Ala Gin Val Pro Pro Asp Thr Glu Thr Phe He Arg Ala 

210 215 220 

Val Gly Ser Val Phe Val Ser Thr Leu Leu Thr Met Ala Met Pro His 
225 230 235 240 

Trp Leu Arg His Leu Val Pro Gly Pro Trp Gly Arg Leu Cys Arg Asp 

245 250 255 

Trp Asp Gin Met Phe Ala Phe Ala Gin Arg His Val Glu Arg Arg Glu 

260 265 270 

Ala Glu Ala Ala Met Arg Asn Gly Gly Gin Pro Glu Lys Asp Leu Glu 

275 280 285 

Ser Gly Ala His Leu Thr His Phe Leu Phe Arg Glu Glu Leu Pro Ala 

290 295 300 

Gin Ser He Leu Gly Asn Val Thr Glu Leu Leu Leu Ala Gly Val Asp 
305 310 315 320 

Thr Val Ser Asn Thr Leu Ser Trp Ala Leu Tyr Glu Leu Ser Arg His 

325 330 335 

Pro Glu Val Gin Thr Ala Leu His Ser Glu He Thr Ala Ala Leu Ser 

340 345 350 

Pro Gly Ser Ser Ala Tyr Pro Ser Ala Thr Val Leu Ser Gin Leu Pro 

355 360 365 

Leu Leu Lys Ala Val Val Lys Glu Val Leu Arg Leu Tyr Pro Val Val 
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370 375 380 

Pro Gly Asn Ser Arg Val Pro Asp Lys Asp He His Val Gly Asp Tyr 
385 390 395 400 

lie He Pro Lys Asn Thr Leu Val Thr Leu Cys His Tyr Ala Thr Ser 

405 410 415 

Arg Asp Pro Ala Gin Phe Pro Glu Pro Asn Ser Phe Arg Pro Ala Arg 

420 425 430 

Trp Leu Gly Glu Gly Pro Thr Pro His Pro Phe Ala Ser Leu Pro Phe 

435 440 445 

Gly Phe Gly Lys Arg Ser Cys Met Gly Arg Arg Leu Ala Glu Leu Glu 

450 455 460 

Leu Gin Met Ala Leu Ala Gin lie Leu Thr His Phe Glu Val Gin Pro 
465 470 475 480 

Glu Pro Gly Ala Ala Pro Val Arg Pro Lys Thr Arg Thr Val Leu Val 

485 490 495 

Pro Glu Arg Ser lie Asn Leu Gin Phe Leu Asp Arg 
500 505 



<210> 3 
<211> 2386 
<212> DNA 
<213> Mus musculus 

<220> 
<221> CDS 

<222> (30).., (1550) 



<400> 3 

ctctcgaagc agactcccca aacacagac atg acc cag gca gtc aag etc gcc 53 

Met Thr Gin Ala Val Lys Leu Ala 
1 5 

tec aga gtt ttt cac cga ate cac ctg cct etg cag ctg gat gcc teg 101 

Ser Arg Val Phe His Arg He His Leu Pro Leu Gin Leu Asp Ala Ser 

10 15 20 

ctg ggc tec aga ggc agt gag teg gtt etc egg age ttg tet gae ate 149 

Leu Gly Ser Arg Gly Ser Glu Ser Val Leu Arg Ser Leu Ser Asp lie 
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25 

cct ggg 
Pro Gly 

ggg ctg 
Gly Leu 

ggg cca 
Gly Pro 

gcc gac 
Ala Asp 
90 

cca gag 
Pro Glu 
105 

cag cgt 
Gin Arg 

etc cga 
Leu Arg 

ggc tat 
Gly Tyr 

eta agg 
Leu Arg 
170 
gac gtg 
Asp Val 
185 

gtg ctg 
Val Leu 

gac aca 
Asp Thr 



ccc tct 
Pro Ser 

tec agg 
Ser Arg 
60 

ata tgg 
He Trp 
75 

cct aca 
Pro Thr 

cgc tgt 
Arg Cys 

get tgc 
Ala Cys 

agt ett 
Ser Leu 
140 
get gga 
Ala Gly 
155 

cgc cag 
Arg Gin 

gca gga 
Ala Gly 

ctg gga 
Leu Gly 

gaa ace 
Glu Thr 



30 

aca etc 
Thr Leu 
45 

ctg eat 
Leu His 

tct ggc 
Ser Gly 

ett gtg 
Leu Val 

agt ttc 
Ser Phe 
110 
gga ttg 
Gly Leu 
125 

ctg gcc 
Leu Ala 

act ctg 
Thr Leu 

egg gga 
Arg Gly 

gag ttt 
Glu Phe 
190 
teg cgc 
Ser Arg 
205 

ttc ata 
Phe He 



age ttc 
Ser Phe 

gaa ctg 
Glu Leu 

age ttt 
Ser Phe 
80 

gag cag 
Glu Gin 
95 

tea tea 
Ser Ser 

eta acg 
Leu Thr 

eeg etc 
Pro Leu 

gac aac 
Asp Asn 
160 
cgt ggc 
Arg Gly 
175 

tac aaa 
Tyr Lys 



ctg get 
Leu Ala 
50 

cag gtg 
Gin Val 
65 

ggg aca 
Gly Thr 

etc ctg 
Leu Leu 

tgg gca 
Trp Ala 

geg gat 
Ala Asp 
130 
etc etc 
Leu Leu 
145 

gtg gte 
Val Val 

tct ggg 
Ser Gly 

ttt ggc 
Phe Gly 



ctg ggc tgc eta 
Leu Gly Cys Leu 
210 

eat gca gtg ggc 
His Ala Val Gly 



35 

gaa etc 
Glu Leu 

cat ggc 
His Gly 

ett cgc 
Leu Arg 

cga caa 
Arg Gin 
100 
gag eac 
Glu His 
115 

ggt gaa 
Gly Glu 

egg cca 
Arg Pro 

cgt gac 
Arg Asp 

eta ccc 
Leu Pro 
180 
eta gaa 
Leu Glu 
195 

gag get 
Glu Ala 



ttc tgc 
Phe Cys 

get geg 
Ala Ala 
70 

aca gtt 
Thr Val 
85 

gaa agt 
Glu Ser 

cgt cgc 
Arg Arg 

gaa tgg 
Glu Trp 

caa gca 
Gin Ala 
150 
ett gtg 
Leu Val 
165 

ggc eta 
Gly Leu 



40 
aaa ggg 
Lys Gly 
55 

egg tac 
Arg Tyr 

tac gtt 
Tyr Val 

eac tgt 
His Cys 

cgc eac 
Arg His 
120 
cag agg 
Gin Arg 
135 

gee geg 
Ala Ala 

cga cga 
Arg Arg 

gtt ctg 
Val Leu 



tea gtg 
Ser Val 



agt ata ggc geg 
Ser He Gly Ala 
200 

gaa gtc cct cct 
Glu Val Pro Pro 
215 

ttt gtg tct aca 
Phe Val Ser Thr 



197 



245 



293 



341 



389 



437 



485 



533 



581 



629 



677 



725 
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etc ttg 
Leu Leu 

ccc tgg 
Pro Trp 
250 
cag agg 
Gin Arg 
265 

gga aag 
Gly Lys 

ctt ttt 
Leu Phe 

gag eta 
Glu Leu 



aca etc 
Ihr leu 

330 
tct gag 
Ser Glu 
345 

act get 
Thr Ala 

ttg aga 
Leu Arg 

gac ate 
Asp lie 

eta tgt 
Leu Cys 



220 
ace atg 
Thr Met 
235 

gee cge 
Ala Arg 

cac gtg 
His Val 

cct gag 
Pro Glu 

egg gaa 
Arg Glu 
300 
eta ctg 
Leu Leu 
315 

tat gag 
Tyr Glu 



ate aca 
lie Thr 

ctg tec 
Leu Ser 



ttg tae 
Leu Tyr 
380 
cgt gta 
Arg Val 
395 

cac tat 
His Tyr 



gcg atg 
Ala Met 

etc tgc 
Leu Cys 

gag ctg 
Glu Leu 
270 
gag gat 
Glu Asp 
285 

aag gtg 
Lys Val 

get gga 
Ala Gly 

ctt tec 
Leu Ser 

get ggg 
Ala Gly 
350 
cag ctg 
Gin Leu 
365 

cct gtg 
Pro Val 



ccc aac 
Pro Asn 
240 
cga gac 
Arg Asp 
255 

cga gaa 
Arg Glu 

atg ccg 
Met Pro 

tct gtc 
Ser Val 

gtg gac 
Val Asp 
320 
egg cac 
Arg His 
335 

ace cgt 
Thr Arg 

ccc ctg 
Pro Leu 

gta cct 
Val Pro 



225 

tgg ttg cac cac 
Trp Leu His His 



gga aac tat gta 
Gly Asn Tyr Val 
400 

gcc act tea agg 
Ala Thr Ser Arg 



tgg gat 
Trp Asp 

ggt gaa 
Gly Glu 

tct ggg 
Ser Gly 
290 
cag tec 
Gin Ser 
305 

acg gta 
Thr Val 

ccc gat 
Pro Asp 

ggc tec 
Gly Ser 

tta aag 
Leu Lys 
370 
ggg aat 
Gly Asn 
385 

att ccc 
lie Pro 



cag atg 
Gin Met 
260 
get gcg 
Ala Ala 
275 

cat cac 
His His 

ata gtg 
He Val 

tec aat 
Ser Asn 

gtc cag 
Val Gin 
340 
tgt gee 
Cys Ala 
355 

get gtg 
Ala Val 

tee cgt 
Ser Arg 

caa gat 
Gin Asp 



230 

ctt ata cct gga 
Leu He Pro Gly 
245 

ttt gcc ttt gee 
Phe Ala Phe Ala 



gac ccc aca cag 
Asp Pro Thr Gin 



atg agg 
Met Arg 

tta ace 
Leu Thr 

ggg aat 
Gly Asn 
310 
acg etc 
Thr Leu 
325 

act gca 
Thr Ala 

cac ccc 
His Pro 

ate aaa 
He Lys 

gtc eca 
Val Pro 
390 
acg eta 
Thr Leu 
405 

ttt eca 
Phe Pro 



aac cag 
Asn Gin 
280 
cac tte 
His Phe 
295 

gtg aca 
Val Thr 



tec tgg 
Ser Trp 

etc cac 
Leu His 

cat ggc 
His Gly 
360 
gaa gtg 
Glu Val 
375 

gac aga 
Asp Arg 

gtc tec 
Val Ser 

gac ccc 
Asp Pro 



773 



821 



869 



917 



965 



1013 



1061 



1109 



1157 



1205 



1253 



1301 



- 7 



410 415 420 

aac tct ttt aat cca get cgc tgg ctg ggg gag ggt ccg acc ccc cac 1349 
Asn Ser Phe Asn Pro Ala Arg Trp Leu Gly Glu Gly Pro Thr Pro His 
425 430 435 440 

cca ttt gca tct ctt ccc ttc ggc ttt ggc aaa egg age tgc ate ggg 1397 
Pro Phe Ala Ser Leu Pro Phe Gly Phe Gly Lys Arg Ser Cys lie Gly 

445 450 455 

aga cgc ttg gca gag ctt gag eta caa atg get ttg tee cag ate ttg 1445 
Arg Arg Leu Ala Glu Leu Glu Leu Gin Met Ala Leu Ser Gla He Leu 

460 465 470 

acc cat ttt gaa gtg eta ect gag cca ggt get ctt cct ate aag ccc 1493 
Thr His Phe Glu Val Leu Pro Glu Pro Gly Ala Leu Pro He Lys Pro 

475 480 485 

atg acc egg act gtc ctg gtc ect gag agg age ate aat eta cag ttt 1541 
Met Thr Arg Thr Val Leu Val Pro Glu Arg Ser He Asa Leu Gin Phe 

490 495 500 

gta gat aga taaccatteg gaagacagcc aacatcgtet ctctcaagac 1590 
Val Asp Arg 
505 

aggatggggt ctttgttata cacaagaggc acactctcct tggaggcetg tctgaccgag 1650 
caaactccag gaagcaggtc ctgacetatg tgtacttggc ctgactcagc aggcatcgca 1710 
gaaccaccat ctttctcctt cctgctcagt gcctctcetg ateattcctc aggatccaat 1770 
gccttcagat tttaacacat ecttaaagtg ecaacgcagg ggttaactae caactceagg 1830 
eagcctgggg agggattcgc ccctgatcct gtagtgttcg ttgatgctct gtctaagcat 1890 
ttatcacgge acaagetaag tgattgcatc tggtctgcac ctggctgcat ctctacctga 1950 
ccatgtgtgt gccttctgag aagagtaatg actagtctac tgggctttta gctctttttc 2010 
tttttgagac agagtcttgc tatgtattcc atgctgtcct ggaaatteae aacttccttg 2070 
cetcaccttt cccaagtatt gggttacaga cttgagctac cacttccagc tgtatcagtc 2130 
tttatatetc ctgceagagt etatcccttg gttatttcag caeeatacat ttctcagact 2190 
gaacctggac catgtggcag gatcgtccac tcaccaggct ctgcccaccc tttttctctc 2250 
ttaatctttc ctctagggaa gtaaatctgc ccttgcctaa tttacagcgt ttttaagcct 2310 
ccgctacctt ggttcttcag ccactctcaa gtggatccac tttcttatca tccatgttta 2370 
ggectgceet tcteca 2386 



<210> 4 
<2ll> 2362 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (!)...( 1524) 

<400> 4 

atg acc cag acc etc aag tac gcc tec aga gtg tte cat ege gtc cgc 48 
Met Thr Gin Thr Leu Lys Tyr Ala Ser Arg Val Phe His Arg Val Arg 
15 10 15 

tgg gcg ccc gag ttg ggc gcc tec eta ggc tac cga gag tac eac tea 96 
Trp Ala Pro Glu Leu Gly Ala Ser Leu Gly Tyr Arg Glu Tyr His Ser 

20 25 30 

gca cgc egg age ttg gea gac ate cea ggc cec tct acg ccc age ttt 144 
Ala Arg Arg Ser Leu Ala Asp He Pro Gly Pro Ser Thr Pro Ser Phe 

35 40 45 

etg gee gaa ctt tte tge aag ggg ggg ctg teg agg eta eac gag ctg 192 
Leu Ala Glu Leu Phe Cys Lys Gly Gly Leu Ser Arg Leu His Glu Leu 

SO 55 60 

cag gts czg ggc gee gcg eac tte ggg ecg gtg tgg eta gee age ttt 240 
Gin Val Gin Gly Ala Ala His Phe Gly Pro Val' Trp Leu Ala Ser Phe 
65 70 75 80 

ggg aca gtg cgc aec gtg tac gtg get gcc cct gea etc gte gag gag 288 
Gly Thr Val Arg Thr Val Tyr Val Ala Ala Pro Ala Leu Val Glu Glu 

85 90 95 

ctg etg cga cag gag gga ecc egg ecc gag cgc tgc age tte teg ccc 336 
Leu Leu Arg Gin Glu Gly Pro Arg Pro Glu Arg Cys Ser Phe Ser Pro 

100 105 110 

tgg acg gag cae ege cgc tgc cgc cag egg get tgc gga ctg etc act 384 
Trp Thr Glu His Arg Arg Cys Arg Gin Arg Ala Cys Gly Leu Leu Thr 

115 120 125 

gcg gaa ggc gaa gaa tgg caa agg etc cgc agt etc ctg gcc ccg etc 432 
Ala Glu Gly Glu Glu Trp Gin Arg Leu Arg Ser Leu Leu Ala Pro Leu 

130 135 140 

etc etc egg cct caa gcg gcc gcc cgc tac gee gga acc ctg aac aac 480 



7- 
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Leu Leu 
145 

gta gtc 
Val Val 

acg ggg 
Thr Gly 

ttc gga 
Phe Gly 

tgc ctg 
Cys Leu 
210 
gtg ggc 
Val Gly 
225 

tgg ctg 
Trp Leu 

tgg m 
Trp Asp 

gca gag 
Ala Glu 

tct ggg 
Ser Gly 
290 
cag tec 
Gin Ser 
305 

acg gtg 
Thr Val 

ccc gaa 



Arg Pro Gin Ala 
150 
ctt gtg 
Leu Val 
165 

gee ctg 
Ala Leu 



tgc gac 
Cys Asp 



ccg ccc 
Pro Pro 
180 
ctg gaa 
Leu Glu 
195 

gag get 
Glu Ala 



Ala Ala Arg Tyr Ala 
155 
agg cgc 
Arg Arg 
170 

gtg gcg 
Val Ala 



egg cgt ctg 
Arg Arg Leu 



teg gtg 
Ser Val 

cgc cac 
Arg His 

cag atg 
Gin Met 
260 
gca gee 
Ala Ala 
275 

gcg cac 
Ala His 



ggc ate 
Gly lie 

caa gtg 
Gin Val 

ttt gtg 
Phe Val 
230 
ctt gtg 
Leu Val 
245 

ttt gca 
Phe Ala 

atg agg 
Met Arg 

ctg ace 
Leu Thr 



ate ctg gga aat 
He Leu Gly Asn 
310 

tec aac aeg etc 
Ser Asn Thr Leu 
325 

gtc cag aca gca 



gtt egg gae 
Val Arg Asp 
185 

gee gcg gtt 
Ala Ala Val 

200 
cea ccc gac 
Pro Pro Asp 
215 

tec acg ctg 
Ser Thr Leu 

cet ggg ccc 
Pro Gly Pro 

ttt get cag 
Phe Ala Gin 
265 

aac gga gga 
Asn Gly Gly 

280 
cac ttc ctg 
His Phe Leu 
295 

gtg aca gag 
Val Thr Glu 



ctg etc 
Leu Leu 

aeg gag 
Thr Glu 

ttg acc 
Leu Thr 
235 
tgg ggc 
Trp Gly 
250 

agg cac 
Arg His 

cag ecc 
Gin Pro 

ttc egg 
Phe Arg 



Gly Thr Leu Asn Asn 
160 

cag egg gga cgt ggc 
Gin Arg Gly Arg Gly 
175 

ggg gaa ttt tac aag 
Gly Glu Phe Tyr Lys 
190 

cgc ttg ggc 
Arg Leu Gly 



ttg eta 
Leu Leu 
315 
ctg tat 
Leu Tyr 
330 

etc cac tea gag ate 



ggc teg 
Gly Ser 
205 
acc ttc 
Thr Phe 
220 

atg gcg 
Met Ala 

cgc etc 
Arg Leu 

gtg gag 
Val Glu 

gag aag 
Glu Lys 
285 
gaa gag 
Glu Glu 
300 

ttg gcg 
Leu Ala 



ate cgc get 
He Arg Ala 

atg ccc cac 
Met Pro His 
240 

tgc ega gae 
Cys Arg Asp 

255 
egg cga gag 
Arg Arg Glu 
270 

gac ctg gag 
Asp Leu Glu 

ttg cet gee 
Leu Pro Ala 



tct tgg get 
Ser Trp Ala 



gga gtg gac 
Gly Val Asp 
320 

gag etc tee egg cac 
Glu Leu Ser Arg His 
335 

aca get gee ctg age 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 
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1104 



Pro Glu Val Gin Thr Ala Leu His Ser Glu He Thr Ala Ala Leu Ser 

340 345 350 

cct ggc tec agt gcc tac ccc tea gee act gtt ctg tec cag ctg cec 
Pro Gly Ser Ser Ala Tyr Pro Ser Ala Thr Val Leu Ser Gin Leu Pro 

355 360 365 

ctg ctg aag gcg gtg gtc aag gaa gtg eta aga ctg tac cct gtg gta 1152 
Leu Leu Lys Ala Val Val Lys Glu Val Leu Arg Leu Tyr Pro Val Val 

370 375 380 

cct gga aat tct cgt gtc cea gac aaa gac att cat gtg ggt gac tat 1200 
Pro Gly Asn Ser Arg Val Pro Asp Lys Asp He His Val Gly Asp Tyr 
385 390 395 400 

att ate ecc aaa aat acg ctg gtc act ctg tgt cac tat gcc act tea 1248 
He He Pro Lys Asn Thr Leu Val Thr Leu Cys His Tyr Ala Thr Ser 

405 410 415 

agg gac cct gcc eag ttc cea gag cea aat tct ttt egt eea get cgc 
Arg Asp Pro Ala Gin Phe Pro Glu Pro Asn Ser Phe Arg Pro Ala Arg 

420 425 430 

tgg ctg ggg gag ggt ccc ace cce cae cea ttt gca tct ett cee ttt 
Trp Leu Gly Glu Gly Pro Thr Pro His Pro Phe Ala Ser Leu Pro Phe 

435 440 445 

£gc ttt ggc aag cgc age tgt atg ggg aga cgc ctg gca gag ctt gaa 
Gly Phe Gly Lys Arg Ser Cys Met Gly Arg Arg Leu Ala Glu Leu Glu 

450 455 460 

ttg caa atg get ttg gcc cag ate eta aea eat ttt gag gtg cag eet 
Leu Gin Met Ala Leu Ala Gin He Leu Thr His Phe Glu Val Gin Pro 
465 470 475 480 

gag cea ggt gcg gee eea gtt aga cec aag ace egg act gtc ctg gta 
Glu Pro Gly Ala Ala Pro Val Arg Pro Lys Thr Arg Thr Val Leu Val 

485 490 495 

eet gaa agg age ate aac eta cag ttt ttg gac aga tagtcccatg 1534 
Pro Glu Arg Ser lie Asn Leu Gin Phe Leu Asp Arg 

500 505 
gaaagagaet gtcatcatca ccettteatt cateataggg ataagatttt ttgtaggcae 1594 
aagaccaagg tatacatctt eccetaatgc ctatctgace aaaetggata gaaccaeeat 1654 
agtgaagtgt gaggeggcte tgaccaatgt gtgaagtatg eacttggeet gaeteaggaa 1714 
.gccaggtgag aaaaceatgg tetctctgct tgcttggcce ttetgateat gtatgcatcc 17Y4 



1296 



1344 



1392 



1440 



1488 
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cccaaggatg aaatcagatt ttaactaata atgctggatg gcctgaagga aagattcaac 1834 

tgcctctctt tttgggcttt catagtgttc attgatgctg ctggctrrgc atttgtcaaa 1894 

gcataagctc agtagctgtg catctggtct gnacctggtt ggtccttcgt ctttgcatgt 1954 

aagctctttg agaggaaggg tgaagtctta tttgtttttt atgtcccctg ccagggcctg 2014 

tctctgacta ggtgtcacca tacacattct tagattgaat ctgaaccatg tggcagaagg 2074 

gataagcagc ttacttagta ggctctgtct acccccttcc ttctttgtct tgcccctagg 2134 

aaggtgaatc tgccctagcc tggtttacgg tttcttataa ctctcctttg ctctctggcc 2194 

actattaggt gggtttgccc catcacttag ttctcaggca gagacatctt tgggcctgtc 2254 

cctgcccagg cctctggctt tttatattga aaatttttaa atattcacaa attttagaat 2314 

aaaccaaata ttccattctt aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2362 





